Intergenic and Genic Sequence Lengths Have Opposite Relationships with Respect to Gene Expression
نویسندگان
چکیده
Eukaryotic genomes are mostly composed of noncoding DNA whose role is still poorly understood. Studies in several organisms have shown correlations between the length of the intergenic and genic sequences of a gene and the expression of its corresponding mRNA transcript. Some studies have found a positive relationship between intergenic sequence length and expression diversity between tissues, and concluded that genes under greater regulatory control require more regulatory information in their intergenic sequences. Other reports found a negative relationship between expression level and gene length and the interpretation was that there is selection pressure for highly expressed genes to remain small. However, a correlation between gene sequence length and expression diversity, opposite to that observed for intergenic sequences, has also been reported, and to date there is no testable explanation for this observation. To shed light on these varied and sometimes conflicting results, we performed a thorough study of the relationships between sequence length and gene expression using cell-type (tissue) specific microarray data in Arabidopsis thaliana. We measured median gene expression across tissues (expression level), expression variability between tissues (expression pattern uniformity), and expression variability between replicates (expression noise). We found that intergenic (upstream and downstream) and genic (coding and noncoding) sequences have generally opposite relationships with respect to expression, whether it is tissue variability, median, or expression noise. To explain these results we propose a model, in which the lengths of the intergenic and genic sequences have opposite effects on the ability of the transcribed region of the gene to be epigenetically regulated for differential expression. These findings could shed light on the role and influence of noncoding sequences on gene expression.
منابع مشابه
Distinctive sequence features in protein coding genic non-coding, and intergenic human DNA.
We have studied the behavior of a number of sequence statistics, mostly indicative of protein coding function, in a large set of human clone sequences randomly selected in the course of genome mapping (randomly selected clone sequences), and compared this with the behavior in known sequences containing genes (which we term genic sequences). As expected, given the higher coding density of the ge...
متن کاملبررسی خصوصیات فیلوژنتیکی جمعیت های زنبور عسل ایرانی (Apis mellifera meda) با استفاده از ژن ND2 میتوکندریایی
For the identification of phylogenetic characteristics of honeybee populations, sampling was conducted from 31 provinces of Iran in spring and summer 2016. Phylogenetic characteristics were evaluated based on mitochondrial ND2 gene. The intergenic regions between ND2 and COI genes were compared in different populations of honeybees. After sequencing and alignment of the genes, the relationships...
متن کاملSimple sequence repeats in organellar genomes of rice: frequency and distribution in genic and intergenic regions
MOTIVATION Simple sequence repeats (SSRs) are abundant across genomes. However, the significance of SSRs in organellar genomes of rice has not been completely understood. The availability of organellar genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. RESULTS We have analyzed SSRs in mitochondrial and chloroplast genomes of rice. We ident...
متن کاملBovine ncRNAs Are Abundant, Primarily Intergenic, Conserved and Associated with Regulatory Genes
It is apparent that non-coding transcripts are a common feature of higher organisms and encode uncharacterized layers of genetic regulation and information. We used public bovine EST data from many developmental stages and tissues, and developed a pipeline for the genome wide identification and annotation of non-coding RNAs (ncRNAs). We have predicted 23,060 bovine ncRNAs, 99% of which are un-a...
متن کاملChromatin state analysis of the barley epigenome reveals a higher‐order structure defined by H3K27me1 and H3K27me3 abundance
Combinations of histones carrying different covalent modifications are a major component of epigenetic variation. We have mapped nine modified histones in the barley seedling epigenome by chromatin immunoprecipitation next-generation sequencing (ChIP-seq). The chromosomal distributions of the modifications group them into four different classes, and members of a given class also tend to coincid...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PLoS ONE
دوره 3 شماره
صفحات -
تاریخ انتشار 2008